NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization

Majumder, Bodhisattwa_Prasad; Dalvi_Mishra, Bhavana; Jansen, Peter; Tafjord, Oyvind; Tandon, Niket; Zhang, Li; Callison-Burch, Chris; Clark, Peter (October 2024, Proceedings of the Conference on Language Modeling (COLM))

Language agents have shown some ability to interact with an external environment, e.g., a virtual world such as ScienceWorld, to perform complex tasks, e.g., growing a plant, without the startup costs of reinforcement learning. While recent work, e.g., Reflexion, has demonstrated how such agents can also self-improve by adding a textual memory of ''hints'' learned from prior experience, such improvements have been limited both in size and scope. In contrast, our goal is a language agent that can robustly improve performance over time, including when both the task and environment are varied. Our approach is to have the agent learn a textual representation of how the world works (rather than just isolated hints), expressed as a memory of causal abstractions, to guide future decision-making. In experiments, we find CLIN is able to continually improve on repeated trials on the same task and environment, outperforming state-of-the-art reflective language agents like Reflexion by 23 points in ScienceWorld and 1.4 points in ALFWorld benchmarks. CLIN can also transfer its learning to new environments and tasks, enhancing performance by 21 points in ScienceWorld and 11 points in ALFWorld
more » « less
Full Text Available
PROC2PDDL: Open-Domain Planning Representations from Texts

https://doi.org/10.18653/v1/2024.nlrse-1.2

Zhang, Tianyi; Zhang, Li; Hou, Zhaoyi; Wang, Ziyu; Gu, Yuling; Clark, Peter; Callison-Burch, Chris; Tandon, Niket (January 2024, Association for Computational Linguistics)

Planning in a text-based environment continues to be a significant challenge for AI systems. Recent approaches have utilized language models to predict planning domain definitions (e.g., PDDL) but have only been evaluated in closed-domain simulated environments. To address this, we present Proc2PDDL, the first dataset containing open-domain procedural texts paired with expert-annotated PDDL representations. Using this dataset, we evaluate the task of predicting domain actions (parameters, preconditions, and effects). We experiment with various large language models (LLMs) and prompting mechanisms, including a novel instruction inspired by the zone of proximal development (ZPD), which reconstructs the task as incremental basic skills. Our results demonstrate that Proc2PDDL is highly challenging for end-to-end LLMs, with GPT-3.5’s success rate close to 0% and GPT-4o’s 38%. With ZPD instructions, GPT-4o’s success rate increases to 45%, outperforming regular chain-of-thought prompting’s 34%. Our analysis systematically examines both syntactic and semantic errors, providing insights into the strengths and weaknesses of language models in generating domain-specific programs.
more » « less
Full Text Available
Analyzing the Contribution of Commonsense Knowledge Sources for Why-Question Answering

Lal, Yash Kumar; Liu, Horace; Tandon, Niket; Chambers, Nathanael; Mooney, Ray; Balasubramanian, Niranjan (May 2022, ACL 2022 Workshop on Commonsense Representation and Reasoning)

Full Text Available
Using Commonsense Knowledge to Answer Why-Questions

https://doi.org/10.18653/v1/2022.emnlp-main.79

Lal, Yash Kumar; Tandon, Niket; Aggarwal, Tanvi; Liu, Horace; Chambers, Nathanael; Mooney, Raymond; Balasubramanian, Niranjan (January 2022, Empirical Methods in Natural Language Processing)

Full Text Available

Search for: All records